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METHOD AND SYSTEM FOR GENERATING HELP FILES 
BASED ON USER QUERIES 

TECHNICAL FIELD 

[0001] The described technology relates to generating help files for use in 

providing help information to users of computer systems. 

BACKGROUND 

[0002] Many applications (i.e., computer programs) provide user interfaces 

through which a user can interact with the application. The user interfaces may 
be very sophisticated and provide the user with hundreds of different features. A 
user who is not very experienced with an application may not know of the 
existence of certain features or may not know how to use certain features. To 
assist such users with the user interface, applications typically provide a help 
system through which a user can request help information describing the various 
features. The help information for an application is typically stored in a help file. 
A help file has an entry for each topic of help information that is available. For 
example, a word processing application that provides a feature to automatically 
correct misspellings, an "AutoCorrect" feature, may have a topic describing the 
feature. Each entry may include the title or name of the topic and content that 
describes the topic in some detail. Figure 1 is a diagram that illustrates the 
structure of an example help file. In this example, the help file includes a content 
file 101 and an index 102. The content file contains the entries of the help file, 
and the index provides a mapping from various keywords (including phrases) to 
the related topics in the content file. In this example, the keywords "AutoCorrect" 
and "Spelling Correction" are mapped to the entry for the "AutoCorrect" topic. A 
keyword may be mapped to multiple topics. For example, the "AutoCorrect" 
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keyword may be also mapped to a topic explaining how to turn off the 
"AutoCorrect" feature. 

[0003] Figures 2A and 2B illustrate typical use of a help system. When a user 

requests help, the help system displays a dialog box 201. The dialog box 
includes an input field 202 for a user to enter a keyword of interest. In this 
example, the user has entered the keyword "spelling." The help system identifies 
all the keywords in the index of the help file that begin with the word "spelling." 
The help system then displays those keywords in a keyword display field 203. 
When a user selects one of the displayed keywords, the help system identifies the 
topics that are mapped to be selected keyword using the index and, if more than 
one topic is identified, displays the titles of those topics to the user. The user can 
then select a topic of interest. When a topic is selected, the help system retrieves 
and displays the content for the selected topic as shown in dialog box 204. 

[0004] Typical help systems also allow users to enter queries or questions, rather 

than simply selecting keywords that happen to be indexed. When a user submits 
a query, the help system may search the content file to identify topics that match 
the user query. (A help file may include a search catalog that the help system 
searches, rather than searching the entire content file. The search catalog maybe 
generated from the content file and is used to improve the speed of searching.) 
The help system then displays the titles of those matching topics to the user so 
that the user can selectively view the content of the topics that the user thinks 
may be relevant. Figure 3 illustrates a dialog box for entry of a user query. 
Dialog box 301 includes an input field 302 for a user to enter their query. The 
query may be in the form of a question or may be a phrase or words of interest. In 
this example, the user entered the question "How do I stop my name from being 
changed?" This question might be asked, for example, if the user's name is "Juli" 
and the "AutoCorrect" feature is configured to automatically "correct" that word to 
"July" on the assumption that the user simply misspelled the month. After a query 
is entered, the help system identifies the topics that best match, that are most 
relevant to, the query. Help systems may use various well-known search 
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algorithms for identifying the matching topics. The help system then displays the 
matching topics in a display field 303. The user can then select one of the 
displayed topics to view its content. 
[0005] Unfortunately, many of the help systems often do not locate any topics that 

the user finds relevant. In the example of Figure 3, the help system determined 
that the topics "field names" and "stop words" best match the user query. The 
help system, however, did not identify the "AutoCorrect" topic as matching 
because none of the non-noise words (e.g., "stop," "name," and "changed") are 
contained in the content of the "AutoCorrect" topic. In such a situation, the user 
may not even be aware of the "AutoCorrect" feature and thus may not be able to 
formulate a query that will identify that topic. The user is only provided with topics 
that are not relevant to the user's query and has no effective way to identify the 
relevant topic. 

[0006] When help files are developed, the help file developers (e.g., technical 

writers) spend considerable effort in trying to identify topics, content, and indexes 
that the users will find helpful. As part of their efforts, developers may talk to 
various users, review newsgroup postings related to the application, talk to the 
application programmers, and so on when creating a help file. If a developer 
knew that users often ask the question of Figure 3, then the developer might 
create a new topic relating to changing a user's name or might add keywords, 
such as "stop name changes," to the index that maps to the "AutoCorrect" topic. 
Such ad-hoc identification of topics and keywords is very expensive and does not 
result in relevant topics being identified for many user queries. 

[0007] The expense of developing help files increases significantly when the help 

files need to be localized (e.g., translated from an original language into a local 
language). Such localization typically involves more than just straightfonvard 
translation of the words of the help file. For example, it may be difficult to find 
keywords in the local language that accurately convey the meaning of keywords in 
the original language. In particular, a localization may involve extensive rewriting 
of the content and identification of new keywords. 
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[0008] It would be desirable to have a way to generate help files that would allow 

a help system to identify topics that users find relevant to their queries. In 
addition, it would be desirable to have a technique for the localization of help files 
that would allow for more efficient and accurate localization. 

SUMMARY 

[0009] A help file generation system generates a help file based on failed user 

queries, that is, user queries that identified no relevant topics. The generation 
system collects the failed user queries and provides them to a help file developer. 
The developer can identify the topic(s) that are relevant to a failed user query. 
The generation system then updates entries for those topics by adding data 
derived from the failed user queries (e.g., the queries themselves) to the entries. 
When a help system uses the updated help file, it factors in the added data in 
determining whether an entry matches a user query. Thus, when a user again 
submits a failed user query, the help system will likely match it to a topic that has 
data derived from a similar failed user query. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram that illustrates the structure of an example help file. 
Figures 2A and 2B illustrate typical use of a help system. 
Figure 3 illustrates a dialog box for entry of a user query. 
Figure 4 is a diagram illustrating a dialog box presented by a help system 
in one embodiment. 

Figure 5 is a display page illustrating the user interface of the help file 
generation system in one embodiment. 

Figure 6 is a block diagram illustrating a help file that has been modified to 
include user query metadata. 

Figure 7 is a block diagram illustrating components of the help file 
generation system in one embodiment. 
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[0017] Figure 8 is a flow diagram illustrating the processing of the help file 

generation system when It receives a failed user query in one embodiment. 

[0018] Figure 9 is a flow diagram illustrating the processing of the link queries 

component in one embodiment. 

[0019] Figure 10 is a flow diagram illustrating the processing of the update user 

queries component in one embodiment. 

DETAILED DESCRIPTION 

[0020] A method and system for generating a help file based on failed user 

queries is provided. In one embodiment, a help file generation system collects 
failed user queries from user computer systems. A "failed user query" is a query 
that has been classified as having "failed" for some reason. For example, a query 
may be considered to have failed if a user indicates that none of the topics 
identified by the query appears to be relevant. A help system may establish that a 
user query failed by prompting the user to indicate whether the identified topics 
were relevant to their query. A query may also be considered to have failed, for 
example, if a user does not view any of the content associated with the identified 
topics. When a user query fails, then the help system may notify the help file 
generation system of the failed user query. The generation system stores the 
failed user queries for use in assisting a developer In developing a help file that 
will result In relevant topics being Identified when similar user queries are 
submitted. The generation system assists a developer In identifying failed user 
queries that are related to a topic, that is, the topic is relevant to the failed user 
query. The generation system then updates the entries in the help file for the 
relevant topics based on the identified user queries. For example, a developer 
may identify that the query "How do I stop my name from being changed?" Is 
related to the "AutoCorrect" topic. Based on that Identification, the generation 
system adds data to the entry for the "AutoCorrect" topic that is derived from the 
identified user query. For example, the generation system may add the user 
query Itself as metadata of the entry. When a help system subsequently uses that 
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help file, It will check the metadata of the entry when searching for a match. If the 
identified user query is again submitted, the help system will likely identify that the 
"AutoCorrect" topic matches because of the similarity between the metadata and 
the submitted user query. Because a developer can effectively link failed user 
queries to their relevant topics, the help file can be adapted to the actual queries 
that users submit. 

[0021] The help file generation system can also be used to assist a developer in 

identifying new topics or new content for the help file. For example, a developer 
may determine from the failed user queries that many different users are 
submitting the query "How do I stop my name from being changed?" or similar 
queries. Although the developers could link the failed user queries to the 
"AutoCorrect" topic, the developer could alternatively create a new topic named 
"Stop changing my name." The content of the topic can be specifically tailored to 
the name changing problem, rather than providing a general description of the 
"AutoCorrect" feature. The developer could also decide to update the content of 
the "AutoCorrect" topic to describe the name changing problem or even change 
the title of the entry. A help system may be able to identify the new topic or the 
"AutoCorrect" topic with its updated content to the same user query, based on 
matching of the title and content of the entries alone. A developer could also link 
the failed user query to the topics so that the metadata derived from the failed 
user query can also be used to match topics. 

too22] In one embodiment, the help file generation system can be used to more 

effectively localize help files. Once an original help file is developed in an original 
language, that help file can be provided to the localizers. After generating the 
localized content, a localizer can use the help file generation system to identify 
failed user queries that are related to various topics. The localizer can then 
translate those user queries and link them to the related topics. The generation 
system then adds metadata to the entries of the help file based on the translated 
user queries. Although the localizer could simply translate the user queries of the 
original help file, some of the user queries may be difficult to translate accurately. 
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By allowing the localizer to identify the user queries that should be linked, the 
localizer can select user queries that can be more quickly and accurately 
translated. In addition, if user queries can be collected in the local language, then 
the localizer can choose from those queries and avoid translations of the user 
queries. 

[0023] Figure 4 is a diagram illustrating a dialog box presented by a help system 

in one embodiment. The dialog box 401 includes an input field 402, a display 
field 403, and a button 404. A user enters their query into the input field, and the 
local help system executing on the user's computer displays the matching topics 
in the display field. The local help system allows the user to indicate that none of 
the topics are relevant by selecting the button. When a user selects the button, 
the local help system may connect the user to a web-based help system. The 
web-based help system may provide more comprehensive help information to the 
user. When connecting, the local help system can provide the failed user query 
and the identification of the application to the web-based help system. The web- 
based help system can store the failed user query and application identification 
for later use by the help file generation system. Alternatively, the local help 
system can send the failed user query and application identification to the help file 
generation system directly without connecting to a web-based help system. 

[0024] Figure 5 is a display page illustrating a user interface of the help file 

generation system in one embodiment. The display page 500 includes a topic 
area 501 , a search area 502, a user queries area 503, and a linked queries area 
504. The display page also includes a find user queries button 505, an add 
button 506, a delete button 507, and a done button 508. Prior to the display of 
this display page, the developer selected a topic to which failed user queries are 
to be linked. In this example, the developer selected the "AutoCorrect" topic. The 
developer enters a search criteria in the search area to select a subset of the 
failed user queries that may be related to the selected topic. In this example, the 
user entered the search criteria "stop changing names." After entering the search 
criteria, the developer selected the find user queries button to direct the 
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generation system to identify the failed user queries that match the search criteria. 
The generation system displays the matching user queries in the user queries 
area. The linked queries area shows those queries that the developer has 
already indicated should be linked to the selected topic. The user can link 
additional user queries by selecting a user query or group of user queries from 
the user queries area and selecting the add button. Similarly, a user can unlink a 
linked user query by selecting the user query from the linked queries area and 
selecting the delete button. When the user is done linking user queries to the 
selected topic, the user selects the done button. In response, the generation 
system updates the entry in the help file of the selected topic based on the linked 
queries - although the updating could be deferred until later when all the entries 
are updated at the same time. 
[0025] Although not shown in Figure 5, the help file generation system may also 

allow the developer to specify the application name as an additional search 
criterion. When an application name is specified, the generation system only 
searches for failed user queries reported by the help system of that application. 
Some help topics may be common to several different applications. When linking 
failed user queries to such a topic, the developer may want to see the user 
queries for all those applications. In contrast, when a topic is unique to an 
application, the developer may want to see the user queries for only that 
application. 

[0026] Figure 6 is a block diagram illustrating a help file that has been updated 

based on failed user queries. The content 601 has been modified to include the 
linked user queries as metadata. The metadata may be specified in various ways, 
such as using a separate tag for each query or grouping all the user queries 
within a single tag. In this example, the linked user queries are grouped in a 
single "<query>" tag. In addition, the generation system may add metadata 
derived from the user queries, rather than adding the user queries themselves. 
For example, if the linked user query is "How do I stop my name from being 
changed?" then the generation system might simply add "stop changing my name" 

[41826-8005-USOOOO/SL040360.282] -8- 2/17/04 



as the metadata. The index 602 may have the same keywords as an index 
without user query metadata or may be adapted based on the linked user queries. 
One skilled in the art will appreciate that when the help file includes a search 
catalog, the search catalog can be generated based on the linked user queries in 
addition to the content. In such a case, the linked user queries would not need to 
be stored as metadata along with the content. 
[0027] Figure 7 is a block diagram illustrating components of the help file 

generation system in one embodiment. The help file generation system 710 is 
connected to user computers 701 via the communications link 702. Each user 
computer may have applications that use a local help system to display help 
information derived from a help file. When the local help system detects that a 
user has indicated that a user query did not identify any relevant topics, then the 
local help system sends a message to the generation system via the 
communications link that includes the failed user query and identifies the 
application. The help file generation system includes a help engine 711, an 
offline query log 712, an online query log 713, a help file(s) 714, and a link 
queries component 715. The help engine receives the messages with the failed 
user queries and application identification and stores them in the offline query log. 
The help engine may also function as a web-based help system to provide help 
information to the users via web pages. The web-based help system may provide 
more detailed help information than is available via an application's local help file. 
The web-based help system may track user queries that are submitted to it and 
then store them in the offline query log along with an application identification and 
an indication of whether the user queries failed or not. The failure rate of a user 
query can be used by a developer in deciding how to update a help file. For 
example, if the same failed user query was reported 10 times by user computers, 
then a developer might decide to link that failed user query to a certain topic. If, 
however, the online query log indicates that that user query failed only 1% of the 
time, then the developer might decide not to link the user query because the 
users usually find the topics relevant. If the online query log also identifies a topic 

(41826-8005-USOOOO/SL040360.282] -9- 2/17/04 



selected by the user when a query was successful, then a developer could use 
that information to help decide which topics are relevant to a user query. For 
example, if users indicate a user query fails 50% of the time and if all the times 
that user query was successful the users selected the same topic, then a 
developer may want to link that user query to that topic so that the topic will be 
ordered higher (i.e., deemed to have greater relevance) in the list of topics that 
are identified. The link queries component is used by a developer to generate or 
modify a help file. The link queries component provides the user interface of 
Figure 5 using the offline query log to modify a help file. The link queries 
component allows the developer to specify the application whose help file is to be 
modified. 

[0028] The user computer systems and the generation system may include a 

central processing unit, memory, input devices (e.g., keyboard and pointing 
devices), output devices (e.g., display devices), and storage devices (e.g., disk 
drives). The memory and storage devices are computer-readable media that may 
contain instructions that implement the generation system. In addition, the data 
structures and message structures may be stored or transmitted via a data 
transmission medium, such as a signal on a communications link. Various 
communications links may be used, such as the Internet, a local area network, a 
wide area network, or a point-to-point dial-up connection. The user computer 
systems can be any type of computing device, such as a personal computer, cell 
phone, personal digital assistant, and so on. 

[0029] Figure 7 illustrates an example of a suitable operating environment in 

which the generation system may be implemented. The operating environment is 
only one example of a suitable operating environment and is not intended to 
suggest any limitation as to the scope of use or functionality of the generation 
system. Other well-known computing systems, environments, and/or 
configurations that may be suitable for use with the generation system include 
personal computers, server computers, hand-held or laptop devices, 
multiprocessor systems, microprocessor-based systems, programmable consumer 
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electronics, network PCs, minicomputers, mainframe computers, distributed 
computing environments that include any of the above systems or devices, and 
the like. 

[0030] The generation system may be described in the general context of 

computer-executable instructions, such as program modules, executed by one or 
more computers or other devices. Generally, program modules include routines, 
programs, objects, components, data structures, and so on that perform particular 
tasks or implement particular abstract data types. Typically, the functionality of 
the program modules may be combined or distributed as desired in various 
embodiments. 

[0031] Figure 8 is a flow diagram illustrating the processing of the help file 

generation system when it receives a failed user query in one embodiment. In 
block 801, the component may normalize the user query by, for example, 
removing capitalization and noise words. In block 802, the component attempts to 
find a matching user query in the offline query log. In decision block 803, if a 
matching user query is found, then the component continues at block 804, else 
the component continues at block 805. In block 804, the component updates a 
count of the number of times the matching user query was reported as having 
failed and then completes. The count may be used by a developer in assessing 
whether the failure of that user query is a common occurrence. In block 805, the 
component adds the user query to the offline query log and then completes. 

[0032] Figure 9 is a flow diagram illustrating the processing of the link query 

component in one embodiment. This component may be invoked after the user 
interface of Figure 5 is displayed. The component Is passed the selected topic 
and application. The component loops processing each input of the developer 
until the developer specifies that the linking is complete. The component then 
updates the help file for the application. In block 901 , the component waits for 
developer input. In decision block 902, if the developer input is to find user 
queries, then the component continues at block 903, else the component 
continues at block 904. In block 903, the component updates the user queries 
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area with a list of the user queries that match the developer's search criteria and 
then loops, to block 901 to wait for the next developer input. In decision block 
904, if the developer input is to add the selected user queries, then the 
component continues at block 905, else the component continues at block 906. In 
block 905, the component adds the selected user queries to the linked queries list 
and then loops to block 901 to wait for developer input. In decision block 906, if 
the developer input is to delete the selected user queries, then the component 
continues at block 907, else the component continues at block 908. In block 907, 
the component removes the selected user queries from the linked queries list and 
then loops to block 901 to wait for developer input. In decision block 908, if the 
developer input is complete, then the component continues at block 909, else the 
component identifies and processes the developer input. In block 909, the 
component updates the help file to add metadata associated with the linked 
queries to the selected topic and then completes. 

[0033] Figure 10 is a flow diagram illustrating the processing of the update user 

queries component in one embodiment. The component is passed an indication 
of a search criteria and an application criteria. In block 1001, the component 
searches the offline query log for user queries that match the search criteria and 
application criteria. In block 1002, the component displays the matching user 
queries in the user queries area and then returns. 

[0034] From the foregoing, it will be appreciated that specific embodiments of the 

invention have been described herein for purposes of illustration, but that various 
modifications may be made without deviating from the spirit and scope of the 
invention. One skilled in the art will appreciate that various techniques may be 
used to determine whether a user query failed. For example, a user query that 
identifies no topics or only identifies topics below a certain relevance level may be 
deemed to have failed. Also, if the user does not select any of the identified 
topics, then the user query may be deemed to have failed. More generally, a 
failed user query can refer to any user query that is provided to a help file system 
for the purpose of modifying the help file. Moreover, indexes can be augmented 
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based on the failed user queries by automatically indexing based on the metadata 
of the entries or by a developer manually adding keywords derived from the linked 
user queries. In this way, the keywords can, over time, migrate to those terms 
actually being used by users, rather than to those terms that developers only think 
may be helpful to users. Accordingly, the invention is not limited except as by the 
appended claims. 
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